4 The Application of the 
Quantitative Paradigm to 
Historical Sociolinguistics: 
Problems with the 
Generalizability Principle 

JUAN M. HERNANDEZ-CAMPOY AND 
NATALIE SCHILLING 


Historical sociolinguistics has often been considered to suffer, perhaps inevitably, 
from lack of representativeness and validity of its findings. This is because the 
sociolinguistic study of historical language forms must rely on linguistic records 
from previous periods - most of which will be incomplete or non-representative 
in some way - as well as on knowledge and understanding of past sociocultural 
situations that can only be reconstructed rather than directly observed or experi¬ 
enced by the researcher. In this paper, we mention the seven main problems we 
have to contend with when trying to practice historical sociolinguistic research: i) 
representativeness, ii) empirical validity, iii) invariation, iv) authenticity, v) author¬ 
ship, vi) social and historical validity, and vii) standard ideology. But theoretical 
and procedural problems are also present even in the apparently rigorous method¬ 
ology of variationist sociolinguistics, as evidenced, for example, in the unevenness 
with which studies conform or fail to conform to the generalizability principle, as 
well as the extent to which 'generalizability' in sociolinguistic studies is even pos¬ 
sible. Indeed, as Bailey and Tillery (2004: 13) note, "without a body of research 
that examines the effects of methods on results," we cannot even know how 
generalizable (i.e. representative and reliable) our studies have been or may be. 
Hence, we cannot hold historical sociolinguistics to standards with which 


The Handbook of Historical Sociolinguistics, First Edition. Edited by Juan Manuel Hernandez-Campoy, 
Juan Camilo Conde-Silvestre. 

© 2012 Blackwell Publishing Ltd. Published 2012 by Blackwell Publishing Ltd. 




64 Methods for the Sociolinguistic Study of the History of Languages 


sociolinguistics itself cannot comply. As Schneider (2002) states, despite its limita¬ 
tions, historical sociolinguistics is not a second-best solution by inevitable neces¬ 
sity, but just the best solution in those areas of study for which oral records are 
not available, especially when studying long-term developments of language vari¬ 
ation and change. In fact, in both approaches (diachronic and synchronic socio¬ 
linguistics) the goal of illuminating the processes of language change is exactly 
the same and the pathways to it are very similar. In Nevalainen and Raumolin- 
Brunberg's (2003: 26) words: "[t]rue, historical data can be characterized as 'bad' 
in many ways, but we would rather place the emphasis on making the best use 
of the data available." 


1. Sociolinguistics and the Quantitative Revolution 

Schools of thought and historical periods cannot be treated as discrete entities, 
with a monolithic nature and abrupt boundaries, since they build upon, and often 
begin as reactions against, preceding stages. Even the same phenomena are dif¬ 
ferently emphasized by different schools, and different aspects of a single theory 
may be highlighted by one school and downplayed or neglected by another. This 
means, as Lass (1984: 8) suggests, that 

the history of any discipline involves a lot of old wine in new bottles (as well as new 
wine in old bottles, new wine in new bottles, and some old wine left in the old 
bottles). Even ideas that seem at the moment self-evidently true do not arise out of 
nowhere, but are the products of a long series of trial-and-error interim solutions to 
perennial problems, illuminated by occasional flashes of creative insight and inspired 
invention. Improvements or even radical restructuring of a theory does not (or 
should not) imply the rejection of everything that went before. 

Kuhn (1962) was in fact very concerned about this nature of science in his thesis 
on scientific revolutions and the emergence of new paradigms. Linguistic theory, 
as Williams (1992: 40) points out, "has not emerged separately from the social 
philosophy of its time. Rather, it must be seen as a manifestation of the ongoing 
debate on the nature of society and the social world." In this way, while the nine¬ 
teenth century philologist, historicist, and comparatist urge in the study of the 
nature of language was in overt opposition to the humanism and classicism of the 
Renaissance and to seventeenth century rationalism, the structuralist turn at the 
beginning of the twentieth century was an alternative to nineteenth-century his- 
toricism and comparativism. Even within the same period, different theoretical 
trends have followed one another in linguistics, such as the structuralism, func¬ 
tionalism, and generativism of the twentieth century. 

In many ways, sociolinguistics also emerged as a kind of reaction against pre¬ 
vious paradigms. Different motivations favored the development of this field of 
study: i) the dissatisfaction among many linguists in the 1960s with previous 
paradigms (e.g. those of Chomsky and Saussure); ii) the redefinition and reformu- 



The Application of the Quantitative Paradigm to Historical Sociolinguistics 65 


lation of traditional dialectology; iii) the growing interest among linguists in 
sociology and its scope; and iv) the quantitative revolution. Focusing on this last 
factor, sociolinguists in the Labovian tradition have sought to adopt scientific 
methods, to develop a quantitative linguistic perspective in which social, cultural, 
and contextual factors can be used to describe, explain, and predict patterns of 
linguistic variation and change (though of course variationists in recent years have 
stressed that mere correlations of social factors with linguistic phenomena are not 
in themselves explanations for these phenomena; see, among others, Eckert 2000; 
Bayley 2002: 134-36). 

The principles of representativeness and generalizability were initially held as 
fundamental in the methodological rigor of variationist sociolinguistic procedure. 
Indeed, they are still often regarded as ideals, though sociolinguists have long 
recognized that achieving genuine statistical representativeness is extremely dif¬ 
ficult and indeed is not always necessary to obtain solid results and revealing 
insights (see Labov 1966; Sankoff 1980: 51-52; Milroy and Gordon 2003: 28-29). 
As far as representativeness is concerned, all members of the community must 
have an equal chance of being selected as representative informants; further, 
informants should be selected preserving the sociological and demographic char¬ 
acteristics of the entire population. With regards to 'generalizability', any socio¬ 
linguistic research, according to Wolfram (2004), has to fulfill two particular 
criteria in order to produce results that can accurately be generalized to the behav¬ 
ior of the entire population, or speech community: in Bailey and Tillery's (2004: 
1) words, "reliability (i.e., that the same results would be obtained in repeated 
observations of the same phenomenon) and subjectivity (i.e., that two different 
researchers observing the same phenomenon would have obtained the same 
results)." 

For all this, Labov's (1982a: 30) Principle of Accountability is crucially important: 
"all occurrences of a given variant are noted, and where it has been possible to 
define the variable as a closed set of variants, all occurrences of the variant in the 
relevant circumstances." That is: "reports of the occurrence of a variant of a lin¬ 
guistic variable must be accompanied by reports of its non-occurrence" (Trudgill 
2003: 3). 


2. Historical Sociolinguistics and the Difficulties of 
Reconstructing the Past 

Chapters 1 and 2 in this Handbook have underlined the hybrid nature of historical 
sociolinguistics as the convergence of historical linguistics and sociolinguistics 
(see also Conde-Silvestre 2007: 19-27). Assuming that the evolution of linguistic 
and social systems always occurs in relation to the sociohistorical situations of 
their speakers and taking into account the tenet that the past should be studied 
in order to understand and explain the present (and vice versa), Romaine (1982, 
1988) proposed this multidisciplinary discipline. Theoretically, its main objective 
is: "to investigate and provide an account of the forms and uses in which variation 



66 Methods for the Sociolinguistic Study of the History of Languages 


may manifest itself in a given speech community over time, and of how particular 
functions, uses and kinds of variation develop within particular languages, speech 
communities, social groups, networks and individuals" (1988: 1453). Methodo¬ 
logically, "the main task of socio-historical linguistics is to develop a set of proce¬ 
dures for the reconstruction of language in its social context, and to use the 
findings of sociolinguistics as controls on the process of reconstruction and as a 
means of informing theories of change" (Romaine 1988: 1453). Obviously, as the 
database for historical sociolinguistic analysis is based on written texts of all kinds 
from earlier periods, the production of computer-readable corpora has undoubt¬ 
edly facilitated the development of research (see Chapter 6 in this Handbook). In 
this way, the amount of digitalized corpora available now for historical sociolin¬ 
guists is a sign of the fruitful relation between linguistics, history, and information 
technology and their burgeoning growth in recent decades. 

Schneider (2002) and Bauer (2002) highlight the potential and problems (pros 
and cons) of relying on written sources and public corpora (large collections of 
naturally occurring language data, including those that are computer-readable/ 
searchable) as linguistic data for variationist analysis. The most important disad¬ 
vantage of datasets of historical documents is that they very often lack representa¬ 
tiveness and possibly also validity, since, as noted above, the historical record is 
incomplete, and written materials may or may not be reflective of the spoken 
language of the time period under study. As Labov (1972: 98) aptly notes, "[t]exts 
are produced by a series of historical accidents; amateurs may complain about 
this predicament, but the sophisticated historian is grateful that anything has 
survived at all. The great art of the historical linguist is to make the best of this 
bad data, 'bad' in the sense that it may be fragmentary, corrupted, or many times 
removed from the actual productions of native speakers." 

Nevalainen and Raumolin-Brunberg (Raumolin-Brunberg 1996; Nevalainen 
and Raumolin-Brunberg 1998; Nevalainen 1999) and Ayres-Bennett (2001) have 
underlined differences and similarities between historical sociolinguistics and 
variationist sociolinguistics, with special attention paid to the methodological 
problems associated with practicing the diachronic approach of the former. 
Complementing their review, we mention the seven main problems confronting 
those who practice historical sociolinguistic research, namely: i) representative¬ 
ness, ii) empirical validity, iii) invariation, iv) authenticity, v) authorship, vi) social 
and historical validity, and vii) standard ideology. 

2.1 Representativeness 

As noted above, the data used in historical sociolinguistic study often suffer from 
lack of representativeness. Echoing Labov (1972), Milroy and Gordon (2003: 177) 
note that "data are often patchy as a consequence of the random preservation of 
some texts and the equally random loss of others" (see also Schneider 2002: 81-90). 
Hence, researchers must be careful to assess what types of speaker, population 
segments, and forms of language their samples encompass, since of course they 
cannot re-engineer pre-existing datasets. 



The Application of the Quantitative Paradigm to Historical Sociolinguistics 67 


2.2 Empirical validity 

The fact that historical sociolinguists must work with text collections whose size 
is inevitably limited, even those compiled into large computerized corpora, means 
that we are often limited in terms of quantitative analysis and associated statistical 
measures (Schneider 2002: 89).The experience of researchers in sociolinguistics 
suggests that the number of tokens per cell of dependent variables has to be, at 
least 15 and, ideally, 30 or more, to ensure statistical representativeness. Further, 
following general social-scientific standards, Labov (1966) suggested the need to 
use 0.025% of the population to ensure representativeness in terms of independent 
social factors. 

In addition to limitations in linguistic data, there are also limitations in the 
social data we can obtain in studying past communities. Raumolin-Brunberg 
(1996) summarizes some of the differences in data, foci, and nature of the conclu¬ 
sions that can be drawn in studying present-day vs. past communities. These are 
presented here in Table 4.1. 

As we can see from the table, not only are we faced with limited linguistic data, 
in terms of type, amount, and coverage, but also with more limited sociocultural 
data than can be obtained in the study of current communities (see section 2.3 
below for more on the latter issue). 


Table 4.1. Sociolinguistic research on current and historical data (from 
Raumolin-Brunberg 1996: 18; Reproduced by permission of Editions Rodopi B.V.) 



Present-day 

Past 

Object of investigation 

Phonological variation 

Grammatical variation and/ 


and/or change 

or change 

Research material 

Spoken language 

Written language 


All people 

Only literate people (upper 
ranks, men) 


Authentic speech: 
observation, elicitation, 
evaluation 

Randomly preserved texts 

Social context 

Society familiar, much 

Social structure to be 


data available 

reconstructed on the basis 
of historical research 

Standardization 

Significant element 

Significance varies 

Associated discipline 

Sociology 

Social history 

Length and residt of the change 

Unknown 

Known 






68 Methods for the Sociolinguistic Study of the History of Languages 


2.3 Invariation 

The kind of research materials available in the sociolinguistic study of historical 
vs. present-day communities (written vs. spoken) inevitably affects the object of 
investigation: phonological vs. grammatical variation and/or change. And given 
that written language tends to be more conservative, normative, and formal than 
oral language, the fact of having to rely on written sources constrains the probabil¬ 
ity of variation (Dees 1971; Arnaud 1980; Roberts and Street 1997; Conde-Silvestre 
and Hernandez-Campoy 1998: 109-10). Hence, we must be careful that we do not 
overstate the extent of apparent uniformity or the firmness of our conclusions 
regarding the patterning of the variability we do observe, since in all likelihood, 
there was probably more variability, perhaps with different patterns, in older 
spoken language than in the written records that remain. Further, the variation 
we do find across manuscripts, if any, may be due either to dialectal or other 
demographic / sociological differences or to stylistic differences across speakers or 
across time periods (Toon 1976, 1983; Conde-Silvestre and Hernandez-Campoy 
1998: 109-10). 

2.4 Authenticity (purity in texts) 

Another important fact is that "the linguistic forms in such documents are often 
distinct from the vernacular of the writers, and instead reflect efforts to capture a 
normative dialect that never was any speaker's native language. As a result, many 
documents are riddled with the effects of hypercorrection, dialect mixture, and 
scribal error" (Labov 1994: 11). The transmission of manuscripts also involves 
problems, since many of them were copied from earlier ones which are now lost, 
and in many of them the original dialect may be mixed up with that of the scribe. 
Manuscripts also show intra-textual variation - that is, different portions of manu¬ 
scripts appear to be written in different dialects (see Stockwell and Barritt 1951, 
1961; Conde-Silvestre and Hernandez-Campoy 1998: 110; also Chapter 7 in this 
Handbook). As a result, "the relationship between data derived from various kinds 
of written source and the data of spoken interaction which forms the basis of much 
contemporary sociolinguistic work is unclear" (Milroy and Gordon 2003: 177). Of 
course, sociolinguists have increasingly been questioning traditional notions of 
'authentic' language data, since all speech communities are subject to dialect (and 
language) contact and mixing, and everyday speech is fraught with self-conscious 
usages of various sorts, including hypercorrections, as well as stylistic variation 
that can render it impossible to locate a given speaker's 'genuine' vernacular style 
(see Schilling-Estes 2004). 

2.5 Authorship 

Especially in the case of private correspondence, letters might not have been 
autographs (hand-written personally by the author) but may instead have been 
written by an amanuensis (a helper writing from dictation, normally a family clerk 



The Application of the Quantitative Paradigm to Historical Sociolinguistics 69 


or chaplain) due to the widespread illiteracy characteristic of early historical time 
periods (Hernandez-Campoy and Conde-Silvestre 1999; Schneider 2002: 76). For 
example, in their study of the Middle English Paston Letters (Davis 1971), 
Hernandez-Campoy and Conde-Silvestre (1999), found that they had to exercise 
great care in interpreting patterns of variation found in the writings of one female 
informant, Margaret Paston (71400-79). Her writings were, unexpectedly, the most 
non-standard of the entire collection (which included fifteen members of the 
Paston family) and hence seemingly divergent from contemporary patterns for 
the sociolinguistic behavior of women in the current Western industrialized world, 
at least as noted in very general terms. However, there is external evidence (Davis 
1971: xxv; Bergs 2005: 79) strongly suggesting that Margaret did not write the 
letters herself, but the family clerk and chaplain - James Gloys - or other scribes 
connected to the family did so for her. In fact, Bergs' (2005: 79-80) analysis sug¬ 
gests that dictation would just affect phonological or graphological variables, but 
not morpho-syntactic ones. Hence, it would be erroneous to conclude from the 
analysis that women of Margaret Paston's time were actually less standard in their 
writing (and possibly speech) than men. Figure 4.1 shows the extent of Margaret 
Paston's use of vernacular (87%) vs. incipient standard (13%) spelling features - 
(sh), (wh) and (u) - as compared with three male family members (William Paston 
I, Clement Paston and John Paston II). 


The Paston Family 



Informants 


H % Standard | % Non-Standard 


Figure 4.1. Percentage usage of the incipient standard in the Paston Family (from 
Hernandez-Campoy and Conde-Silvestre 1999: 261; Reproduced by permission of 
Editum) 




































70 Methods for the Sociolinguistic Study of the History of Languages 


Table 4.2. Present-day social class groups in the Western world 


Status Profession 


Upper class 

Middle-Middle Class 

Lower-Middle Class 

Upper-Working Class 
Middle-Working Class 
Lower-Working Class 


Executives, directors, owners, people with inherited 
wealth 

Professional people: executive managers, employers, 
bank clerks, insurance workers, school teachers 
Semi-professional people or non-manual workers: lower 
managers, typists, commercial travelers, office workers 
Foremen, skilled manual workers, clerks 
(Semi-skilled) manual workers 
Laborers, unskilled laborers, seasonal workers 


2.6 Social and historical validity: historical and 
socio-cultural background 

Given that "we usually know very little about the social position of the writers, 
and not much more about the social structure of the community" (Labov 1994: 
11), the task of "reconstructing the social information needed to interpret patterns 
of variation in written texts is not always straightforward" (Milroy and Gordon 
2003: 177). If we compare Labov (1966) or Trudgill's (1974) social class groups 
(see Table 4.2) with those of the past (see Tables 4.3-4.4), such as those character¬ 
izing Tudor (1485-1603) and Stuart (1603-1714) England (see Raumolin-Brunberg 
1996), the social orders are quite different, which inevitably affects the configura¬ 
tion of independent socio-demographic variables (see also Conde-Silvestre and 
Hernandez-Campoy 2004). 

2.7 Standard ideology 

Both non-linguists' and linguists' views regarding language systems and their 
evolution may be influenced by 'the standard ideology' - that is, the notion that 
certain languages exist in standardized form (J. Milroy 1999, 2001). This notion 
and surrounding beliefs about the superiority, correctness, clarity, etc. of 'stand¬ 
ard' languages have, as Britain (2005) points out, become so entrenched in the way 
we think that they have become conventionalized and taken for granted. In other 
words, they have become language ideologies. Linguistic studies of the develop¬ 
ment of nonstandard dialects are not immune from these ideologies and indeed 
have been made difficult because histories of languages, until recently, were 
focused on the history of the standard variety. Non-standard dialects may (uncon¬ 
sciously) be seen as implicitly 'derived from' and even 'perversions of' standard 
dialects (Britain 2005), despite linguists' overt protestations that all dialects are 
equally valid. 






Table 4.3. Rank and status in Tudor (1485-1603) and Stuart (1603-1714) 
England (from Raumolin-Brunberg 1996: 26; reproduced by permission of 
Editions Rodopi B.V.) 


Estate 


Grade 

Title 

GENTRY 

Nobility 

Royalty 

Duke 

Archbishop 

Lord, Lady 



Marquess 

Earl 

Viscount 




Baron 

Bishop 



Gentry 

proper 

Baronet (1611—) 

Knight 

Sir, Dame 



Esquire 

Gentleman 

Mr, Mrs 


Professions 

Army Officer (Captain, etc.). Government Official 
(Secretary of State, etc.). Lawyer, Medical Doctor 
(Doctor), Merchant, Clergyman, Teacher, etc. 

NON¬ 

GENTRY 


Yeoman 

Merchant 

Husbandman 

Goodman, 

Goodwife 



Craftsman 

Tradesman 

Artificer 

(Name of Craft: 
Carpenter, etc.) 



Labourer 

Cottager 

Pauper 

none 

(Labourer) 

(none) 





72 Methods for the Sociolinguistic Study of the History of Languages 


Table 4.4. Social rank and the diffusion of incipient 
standard spellings in fifteenth-century England 
(Conde-Silvestre and Hernandez-Campoy 2005: 116; 
reproduced by permission of Editum) 


Social Position 

Informant 

Upper Gentry 

Sir William Stonor 
Sir John Paston II 
Sir John Paston III 

Lower Gentry 

Walter Elmes 
Richard Germyn 

Professionals 

Thomas Mull 

(legal) 

Richard Page 

Urban Non-Gentry 

Thomas Betson 

(merchants) 

George Cely 
Richard Cely II 


In the case of the history of English, as claimed by Lass (1990: 245), '"historical 
anglistics' has only been interested in the line to the southern British standard. 
This reflects an old ethnocentrism: straight-line evolution to the southern received 
standard (which is 'English') and side-paths of antiquarian or specialist interest 
(leading to the 'dialects')." In reality, as J. Milroy (1999) points out, since approxi¬ 
mately the 16th century, the history of English has almost entirely been a history 
of Standard English. Amongst the number of the 'taken-for-granted characteris¬ 
tics' of histories of English, he includes (J. Milroy 2001: 549; see also Chapter 31 
in this Handbook ): 

English began in the fifth century AD when Germanic tribes settled in Britain 
English has had an unbroken and continuous history since that time and is the same 
language then as it is now 
English is not a mixed language 

One of the consequences of these distorted assumptions has been, on occasion, 
considerable analytical inaccuracy, since, for example, historical linguists have 
made claims about dialect developments which rely on these standard histories, 
rather than examining the histories of the dialects themselves. For example, we 
have the old-fashioned suggestion by many linguists (especially North American 
and Irish linguists) that non-rhotic English accents have an /r/-deletion rule 
rather than an /reinsertion rule in certain prevocalic contexts. However, given 
that intrusive / r/ is often inserted in intervocalic contexts in non-rhotic dialects 
(e.g. the idear of it 'the idea of it') and that non-rhoticity is often transferred to 






The Application of the Quantitative Paradigm to Historical Sociolinguistics 73 


second languages, the /r/-deletion analysis is obviously incorrect from a syn¬ 
chronic point of view, even though historically non-rhoticity clearly derived from 
/r/ production. The assumption seems to be that since older forms of English and 
current Standard American English have non-prevocalic /r/, other dialects derive 
from this by 'deletion,' an assumption at least partially grounded in 'the standard 
ideology'. 

In Spanish, writers on Andalusian phonology have traditionally focused on 
'deletion' of word-final consonants (except /m, n/) and postulated a five-vowel 
system plus allophony in their dialectological descriptions. Their assumption 
seems to be that since Standard Spanish has five vowels, southern varieties must 
also have a five-vowel system. Nevertheless, Hemandez-Campoy and Trudgill 
(2002) show that though consonant deletion is of course the correct diachronic 
analysis, synchronically speaking, some southern varieties such as Murcian 
Spanish and Andalusian Spanish do not delete syllable-final consonants but 
instead do not have them in underlying representation. The diachronic loss of 
syllable-final consonants had dramatic consequences for the Murcian vowel 
system, leading to a current system consisting of eight vowels - not five vowels 
and related allophones. Historical word-final /eC, oC, aC/ became /e, a, ae/, 
respectively, and the same vowels then began to occur in certain cases word- 
internally. Hence, an eight-vowel system has developed in conjunction with the 
loss of syllable-final consonants. The current eight-vowel system of Murcian 
Spanish is shown in Figure 4.2. 



Figure 4.2. Vowel System of Murcian Spanish 


3. Conclusion 

Historical sociolinguistics has very often been regarded as limited in its lack of 
representativeness and empirical validity. In this chapter, we have reviewed some 
of the main methodological problems encountered when trying to practice 
historical sociolinguistic research: i) representativeness, ii) empirical validity, iii) 










74 Methods for the Sociolinguistic Study of the History of Languages 


invariation, iv) authenticity, v) authorship, vi) social and historical validity, and 
vii) standard ideology But, although nowadays the study of language change is 
practically inconceivable without concurrent consideration of sociolinguistic vari¬ 
ation, theoretical and procedural limitations, controversies, and contradictions 
also exist even within the apparently rigorous scientific field of variationist socio¬ 
linguistics, as evidenced, for example, by the following factors: 

(a) the unevenness with which the generalizability principle is applied or adhered 
to (see Bailey and Tillery 2004); 

(b) questions surrounding age, time, and language change, including the limita¬ 
tions of apparent time studies of change and the difficulties of conducting 
studies in real time (see Trudgill 1988; Eckert 1997; Bailey 2002); 

(c) ethical issues in sociolinguistic fieldwork (see Labov 1982b; Labov 1984; 
L. Milroy 1987; Larmouth, Murray, and Murray 1992; Cameron, Frazer, Harvey, 
et al. 1992,1997; Shuy 1993; Murray and Murray 1996; Josselson 1996; Paulston 
and Tucker 1997; Wolfram and Fasold 1997; Johnstone 2000; Feagin 2002; 
Milroy and Gordon 2003); 

(d) stylistic variation (see Labov 1972, 2001; Bell 1984, 2001; Coupland 2001a, 
2001b, 2007; Johnstone 1996; Eckert 2000; Eckert and Rickford 2001; Rickford 
and McNair-Knox 1994; Schilling-Estes 2002, 2004; Cutillas-Espinosa and 
Hernandez-Campoy 2006, 2007; Cutillas-Espinosa, Hernandez-Campoy, and 
Schilling-Estes 2010; Hernandez-Campoy and Cutillas-Espinosa 2010); 

(e) differences in data-gathering procedures and subsequent results, including 
differences in sampling procedures, interviewer characteristics, and elicitation 
strategies (such as formal elicitation frames and loosely structured interviews) 
(see Bailey, Wikle, and Tillery 1997; Montgomery 1998; Bailey and Tillery 1999; 
Cukor-Avila and Bailey 2001; Bailey 2002; Chambers 2002; Bailey and Tillery 
2004); and 

(f) issues involved in data processing and analysis, including how spoken data 
are transformed when transcribed into written form (see Labov 1982a; Hildyard 
and Olson 1982; or Schneider 2002). 

Again, as far as historical sociolinguistics is concerned, we cannot ask of this 
newer field what we cannot even ask of the more established field of variationist 
sociolinguistics. Just because there are methodological problems, we should not 
consider it an empirically invalid and inaccurate field of research. It is crucial in 
those areas of study for which oral records are not available, especially when 
studying long-term developments in language variation and change. Further, 
historical sociolinguistics continues to advance, for example through develop¬ 
ments in information technology and sociohistorical study. Despite the limitations 
mentioned earlier, written records such as The Helsinski Corpus of English Texts 
(HC) provide us with cross-stylistic quantitative correlations (Nevalainen 1995; 
Nevalainen and Raumolin-Brunberg 1989), while other text collections such as the 
Corpus of Early English Correspondence (CEEC) provide us with correlations of 
sociohistorical class stratification with linguistic variation (Nevalainen and 



The Application of the Quantitative Paradigm to Historical Sociolinguistics 75 


Raumolin-Brunberg 1996, 1998, 2003). The diversity of types of interaction and 
styles reflected in collections of private correspondence, for example the Paston, 
Cely, and Stonor Letters in late medieval England, is wider than those afforded 
by official and literary documents and provide us with data for which demo¬ 
graphic information about authors can often be traced, hence supplying the data 
necessary for the proper extension of sociolinguistic methods to historical lan¬ 
guage states. Additionally, as some scholars show (see Eliason 1956; J. Milroy 1992; 
Montgomery 1999; Miethaner 2000), written texts can in some ways reflect, though 
partially, pronunciation characteristics. Even in fairly inconsistent orthographic 
systems, such as that of English, there are well-established sets of grapheme- 
phoneme correspondences in writing. 

In conclusion, and extending Robert Henry Robins' (1964/1980: 319) quote 
regarding linguistic studies of five decades ago to today's historical 
sociolinguistics: 

During these men's lifetimes and through their work we have witnessed striking and 
encouraging growth, both in the subject itself and in the scholarly interest taken in 
it. Much remains to be accomplished; new lines of thought open up, and new 
methods must be devised to follow them; and general linguistic theory must be 
always keeping pace with methodological progress. The languages of mankind in all 
their fascinating detail and with all their immense power among the human faculties 
still present a potentially limitless field for disciplined investigation and systematic 
study. It is altogether right that they should be the object of scholarly enthusiasm, 
controlled imagination, and great reverence. 

Historical sociolinguistics began in the early 1980s as a hybrid sub field subsisting 
on historical linguistics and sociolinguistics. Its work is being increasingly done 
within the field of linguistics and has a growing importance in that field. Now, 
maturity has been reached thanks to the efforts of a number of historical sociolin¬ 
guists who, over the past three decades, have devoted themselves to elucidating 
the theoretical limits of the discipline, as well as to applying the tenets and find¬ 
ings of contemporary sociolinguistic research to the interpretation of linguistic 
material from the past. 


REFERENCES 


Amaud, R. (1980) Quelques observations 
quantitatives 'en temps reel' sur un 
changement: L'accroissement d'emploi de la 
forme progressive dans la premiere moitie 
du XIX siecle. Paper presented at the Xxe 
Congres de la Societe des Anglicistes de 
VEnseignement Superieur, Poitiers. 

Ayres-Bennett, W. (2001) Socio-historical 
linguistics and the history of French. 


Journal of French Language Studies 11: 
159-77. 

Bailey, G. (2002) Real and Apparent Time. In 
J.K. Chambers, P. Trudgill, and N. 
Schilling-Estes (eds.), pp. 312-32. 

Bailey, G. and Tillery, J. (1999) The Rutledge 
Effect: the impact of interviewers on 
survey results in linguistics. American 
Speech 74: 389-402. 




76 Methods for the Sociolinguistic Study of the History of Languages 


Bailey, G. and Tillery, J. (2004) Some sources 
of divergent data. In C. Fought (ed.), 
pp. 11-30. 

Bailey, G., Wikle, T„ and Tillery, J. (1997) 

The effects of methods on results in 
dialectology. English World-Wide 
18: 35-63. 

Bauer, L. (2002) Inferring variation and 
change from public corpora. In J.K. 
Chambers, P. Trudgill, and N. Schilling- 
Estes (eds.), pp. 97-114. 

Bayley, R. (2002) The quantitative paradigm. 
In J.K. Chambers, P. Trudgill, and N. 
Schilling-Estes (eds.), pp. 117-41. 

Bell, A. (1984) Language style as audience 
design. Language in Society 13: 145-204. 

Bell, A. (2001) Back in style: Reworking 
audience design. In P. Eckert and J.R. 
Rickford (eds.), pp. 139-69. 

Bergs, A. (2005) Social Netivorks and Historical 
Sociolinguistics: Studies in Morphosyntactic 
Variation in the Paston Letters (1421-1503), 
Mouton de Gruyter, Berlin. 

Britain, D. (2005) Standard ideologies in the 
history of English. Paper given at the 7th 
Curso de La Variacion Sociolingiitstica: 

Enfoque Contrastivo, University of Murcia, 
November 2005. 

Cameron, D., Frazer, E., Harvey, P. et al. 

(1992) Researching Language: Issues of Power 
and Method, Routledge, London. 

Cameron, D., Frazer, E., Harvey, P. et al. 

(1997) Ethics, advocacy and empowerment 
in research language. In N. Coupland and 
A. Jaworski (eds.), pp. 145-62. 

CEEC = Corpus of Early English Correspondence 
(1998) Compiled by T. Nevalainen, H. 
Raumolin-Brunberg, J. Keranen, M. Nevala, 
A. Nurmi, and M. Palander-Collin 
Department of English, University of 
Helsinki, http: / /www.helsinki.fi/varieng/ 
CoRD/corpora/CEEC/index.html 
[accessed October 13, 2011] 

Chambers, J.K. (2002) Patterns of variation 
including change. In J.K. Chambers, P. 
Trudgill, and N. Schilling-Estes (eds.), 
pp. 349-72. 

Chambers, J.K., Trudgill, P., and Schilling- 
Estes, N. (eds.) (2002) The Handbook of 
Language Variation and Change, Blackwell, 
Oxford. 


Conde-Silvestre, J.C. (2007) Sociolingiitstica 
historica, Gredos, Madrid. 

Conde-Silvestre, J.C. and Hernandez- 
Campoy, J.M. (1998) An Introduction to the 
History of the English Language II: Middle 
and Modern English, Universidad de Murcia 
and Diego Marin, Murcia. 

Conde-Silvestre, J.C. and Hernandez- 
Campoy, J.M. (2004) A sociolinguistic 
approach to the diffusion of Chancery 
written practices in late fifteenth-century 
private correspondence. Neuphilologische 
Mitteilungen 105: 133-52. 

Conde-Silvestre, J.C. and Hernandez- 
Campoy, J.M. (2005) Sociolinguistic and 
geolinguistic approaches to the historical 
diffusion of linguistic innovations: 
incipient standardisation in Late Middle 
English. International Journal of English 
Studies 5.1: 101-34. 

Coulmas, F. (ed.) (1997) The Handbook of 
Sociolinguistics, Blackwell, Oxford. 

Coupland, N. (2001a) Dialect stylization in 
radio talk. Language in Society 30 (3): 345-75. 

Coupland, N. (2001b) Language, situation, 
and the relational self: theorizing dialect- 
style in sociolinguistics. In P. Eckert and 
J.R. Rickford (eds.), pp. 185-210. 

Coupland, N. (2007) Style: Language Variation, 
and Identity, Cambridge University Press, 
Cambridge. 

Coupland, N. and Jaworski, A. (eds.) (1997) 
Sociolinguistics: A Reader and Coursebook, 
Macmillan Press, London. 

Cukor-Avila, P. and Bailey, G. (2001) The 
effects of the race of the interviewer on 
sociolinguistic fieldwork. Journal of 
Sociolinguistics 5: 254-70. 

Cutillas-Espinosa, J.A. and Hernandez- 
Campoy, J.M. (2006) Non-responsive 
performance in radio broadcasting: a case 
study. Language Variation and Change 18 (3): 
317-30. 

Cutillas-Espinosa, J.A. and Hernandez- 
Campoy, J.M. (2007) Script design in the 
media: radio talk norms behind a 
professional voice. Language and 
Communication 211 (2): 127-52. 

Cutillas-Espinosa, J.A., Hernandez-Campoy, 
J.M., and Schilling-Estes, N. (2010) 
Hyper-vernacularisation in a speaker 



The Application of the Quantitative Paradigm to Historical Sociolinguistics IT 


design context: A case study. Folia 
Linguistica 44: 1-22. 

Davis, N.O. (ed.) (1971) Paston Letters and 
Papers of the Fifteenth Century (2 vols.). 
Clarendon, Oxford. 

Dees, A. (1971) Etude sur VEvolution des 
Demonstrates en Ancien et en Moyen 
Frangais, Wolters-Noordhoff, Groningen. 

Eckert, P. (1997) Age as a sociolinguistic 
variable. In F. Coulmas (ed.), pp. 151-67. 

Eckert, P. (2000) Linguistic Variation as Social 
Practice, Blackwell, Oxford. 

Eckert, P. and Rickford, J.R. (eds.) (2001) Style 
arid Sociolinguistic Variation, Cambridge 
University Press, Cambridge. 

Eliason, N.E (1956) Tarheel talk: An historical 
study of the English language in North 
Carolina to 1860, University of North 
Carolina Press, Chapel Hill. 

Feagin, C. (2002) Entering the community: 
fieldwork. In J.K. Chambers, P. Trudgill, 
and N. Schilling-Estes (eds.), pp. 20-39. 

Fought, C. (ed.) (2004) Sociolinguistic 
Variation: Critical Reflections, Oxford 
University Press, Oxford. 

HC = The Helsinki Corpus of English Texts 
(1991) Compiled by M. Rissanen (Project 
leader), M. Kyto (Project secretary); L. 
Kahlas-Tarkka, M. Kilpio (Old English); S. 
Nevanlinna, I. Taavitsainen (Middle 
English); T. Nevalainen, H. Raumolin- 
Brunberg (Early Modern English). 
Department of English, University of 
Helsinki, http://www.helsinki.fi/varieng/ 
CoRD / corpora/HelsinkiCorpus [accessed 
October 13, 2011]. 

Hernandez-Campoy, J.M. and Conde- 
Silvestre, J.C. (1999) The social diffusion of 
linguistic innovations in 15th century 
England: Chancery spellings in private 
correspondence. Cuadernos de Filologia 
Inglesa 8: 251-74. 

Hernandez-Campoy, J.M. and Cutillas- 
Espinosa, J.A. (2010) Speaker design 
practices in political discourse: a case 
study. Language and Communication 30: 
297-309. 

Hernandez-Campoy, J.M. and Trudgill, P. 
(2002) Functional compensation and 
Southern Peninsular Spanish /s/ loss. Folia 
Linguistica Historica XXIII: 31-57. 


Hildyard, A. and Olson, D.R. (1982) 

On the comprehension of oral vs. 
written discourse. In D. Tannen (ed.). 

Spoken and Written Language: Exploring 
Orality and Literacy, Ablex, Norwood, NJ, 
pp. 19-33. 

Johnstone, B. (1996) The Linguistic Individual: 
Self-Expression in Language and Linguistics, 
Oxford University Press, Oxford. 

Johnstone, B. (2000) Qualitative Methods in 
Sociolinguistics, Oxford University Press, 
Oxford. 

Josselson, R. (ed.) (1996) Ethics and Process in 
the Narrative Study of Lives, Sage, Thousand 
Oaks, CA. 

Kuhn, T.S. (1962) The Structure of Scientific 
Revolutions, University of Chicago Press, 
Chicago. 

Labov, W. (1966) The Social Stratification of 
English in Nezv York City, Center for 
Applied Linguistics, Washington, DC. 

Labov, W. (1972) Some principles of 
linguistic methodology. Language in Society 
1: 97-120. 

Labov, W. (1982a) Building on empirical 
foundations. In W.P. Lehmann and Y. 
Malkiel (eds.). Directions for Historical 
Linguistics, University of Texas Press, 
Austin, pp. 79-92. 

Labov, W. (1982b) Objectivity and 
commitment in linguistic science: The case 
of the Black English trial in Ann Arbor. 
Language in Society 11 (2): 165-201. 

Labov, W. (1984) Field methods of the project 
on linguistic change and variation. In J. 
Baugh and J. Sherzer (eds.). Language in 
Use: Readings in Sociolinguistics, Prentice 
Hall, Englewoods Cliffs, NJ, pp. 28-66. 

Labov, W. (1994) Principles of Linguistic 
Change, Volume I: Internal Factors, Blackwell, 
Oxford. 

Labov, W. (2001) The anatomy of style- 
shifting. In P. Eckert and J.R. Rickford 
(eds.), pp. 85-108. 

Larmouth, D.W., Murray, T., and Ross 
Murray, C. (eds.) (1992) Legal and Ethical 
Issues in Surreptitious Recording, University 
of Alabama Press, Tuscaloosa. 

Lass, R. (1984) Phonology. An Introduction to 
Basic Concepts, Cambridge University Press, 
Cambridge. 



78 Methods for the Sociolinguistic Study of the History of Languages 


Lass, R. (1990) Where do extra-territorial 
Englishes come from? Dialect input and 
recodification in transported Englishes. 

In S. Adamson, V. Law, N. Vincent, and 
S. Wright (eds.). Papers from the 5th 
International Conference on English Historical 
Linguistics, John Benjamins, Amsterdam, 
pp. 245-80. 

Miethaner, U. (2000) Orthographic 
transcriptions of nonstandard varieties: the 
case of earlier African American English. 
Journal of Sociolinguistics 4: 534-60. 

Milroy, J. (1992) Linguistic Variation and 
Change: On the Historical Sociolinguistics of 
English, Blackwell, Oxford. 

Milroy, J. (1999) The consequences of 
standardisation in descriptive linguistics. 

In T. Bex and R.J. Watts (eds.). Standard 
English: The Widening Debate, Routledge, 
London, pp. 16-39. 

Milroy, J. (2001) Language ideologies and the 
consequences of standardisation. Journal of 
Sociolinguistics 5: 530-55. 

Milroy, L. (1987) Observing and Analysing 
Natural Language, Blackwell, Oxford. 

Milroy, L. and Gordon, M. (2003) 

Sociolinguistics: Method and Interpretation, 
Blackwell, Oxford. 

Montgomery, M. (1998) Multiple modals in 
LAGS and LAMSAS. In M. Montgomery 
and T. Nunnally (eds.). From the Gulf States 
and Beyond: The Legacy of Lee Pederson and 
LAGS, University of Alabama Press, 
Tuscaloosa, pp. 90-122. 

Montgomery, M. (1999) Eighteenth-century 
Sierra Leone English: another exported 
variety of African American English. 
English World-Wide 20: 1-34. 

Murray, T.E. and Ross Murray, C. (1996) 
Under Cover of Law: More on the Legality of 
Surreptitious Recording, University of 
Alabama Press, Tuscaloosa. 

Nevalainen, T. (1995) Ongoing work on The 
Helsinki Corpus of Early English 
Correspondence. Paper presented at the 16th 
ICAME Conference, Toronto, May 23-28. 

Nevalainen, T. (1999) Making the best use of 
'bad' data: evidence for sociolinguistic 
variation in Early Modern English. 
Neuphilologische Mitteilungen 104: 499-533. 

Nevalainen, T. and Raumolin-Brunberg, 

H. (1989) A corpus of Early Modern 


Standard English in a socio-historical 
perspective. Neuphilologische Mitteilungen 
90(1): 67-110. 

Nevalainen, T. and Raumolin-Brunberg, H. 
(eds.) 1996. Sociolinguistics and Language 
History: Studies Based on the Corpus of Early 
English Correspondence, Rodopi, Amsterdam. 

Nevalainen, T. and Raumolin-Brunberg, H. 
(1998) Reconstructing the social dimension 
of diachronic language change. In R.M. 
Hogg and L. van Berger (eds.). Historical 
Linguistics 1995. Vol II: Germanic Linguistics, 
John Benjamins, Amsterdam, pp. 189-209. 

Nevalainen, T. and Raumolin-Brunberg, H. 
(2003) Historical Sociolinguistics: Language 
Change in Tudor and Stuart England, 
Longman Pearson Education, London. 

Paulston, C.B. and Tucker, G.R. (eds.) (1997) 
The Early Days of Sociolinguistics: Memories 
and Reflections, Summer Institute of 
Linguistics, Dallas, TX. 

Raumolin-Brunberg, H. (1996) Historical 
sociolinguistics. In T. Nevalainen and H. 
Raumolin-Brunberg (eds.), pp. 11-37. 

Rickford, J.R. and McNair-Knox, F. (1994) 
Addressee- and topic-influenced style shift: 
A quantitative sociolinguistic study. In D. 
Biber and E. Finegan (eds.), Sociolinguistic 
Perspectives on Register, Oxford University 
Press, Oxford, pp. 235-76. 

Roberts, C. and Street, B. (1997) Spoken and 
written language. In F. Coulmas (ed.), 

pp. 168-186. 

Robins, R.H. (1964/1980) General Linguistics. 
An Introductory Survey (3rd edn), Longman, 
London. 

Romaine, S. (1982) Socio-Historical Linguistics: 
Its Status and Methodology, Cambridge 
University Press, Cambridge. 

Romaine, S. (1988) Historical sociolinguistics: 
problems and methodology. In U. Ammon, 
N. Dittmar, and K.J. Mattheier (eds.). 
Sociolinguistics: An International Handbook of 
the Science of Language and Society (Vol. 2), 
Walter de Gruyter, Berlin, pp. 1452-1469. 

Sankoff, G. (1980) A quantitative paradigm 
for the study of communicative 
competence. In G. Sankoff (ed.). The Social 
Life of Language, University of Pennsylvania 
Press, Philadelphia, pp. 47-79. 

Schilling-Estes, N. (1998) Investigating 
'self-conscious' speech: the performance 



The Application of the Quantitative Paradigm to Historical Sociolinguistics 79 


register in Ocracoke English. Language in 
Society 27: 53-83. 

Schilling-Estes, N. (2002) Investigating 
stylistic variation. In J.K. Chambers, P. 
Trudgill, and N. Schilling-Estes (eds.), 
pp. 375-401. 

Schilling-Estes, N. (2004) Constructing 
ethnicity in interaction. Journal of 
Sociolinguistics 8 (2): 163-195. 

Schneider, E. (2002) Investigating variation 
and change in written documents. In J.K. 
Chambers, P. Trudgill, and N. Schilling- 
Estes (eds.), pp. 67-96. 

Shuy, R. (1993) Risk, deception, 

confidentiality, and informed consent. 
American Speech 68: 103-106. 

Stockwell, R.P. and Barritt, C.W. (1951) 

Some Old E?rglish graphemic-phonemic 
correspondences, Battenburg, Norman. 

Stockwell, R.P. and Barritt, C.W. (1961) Scribal 
practice: some assumptions. Language 37: 
75-82. 

Toon, T.E. (1976) The variationist analysis of 
Early Old English manuscript data. In 


W.M. Christie Jr. (ed.). Proceedings of the 
Second International Conference on Historical 
Linguistics, North Holland, Amsterdam, 
pp. 71-81. 

Toon, T.E. (1983) The Politics of Early Old English 
Sound Change, Academic Press, New York. 

Trudgill, P. (1974) The Social Differentiation of 
English in Norwich, Cambridge University 
Press, Cambridge. 

Trudgill, P. (1988) Norwich revisited: recent 
changes in an English urban dialect. 

English World-Wide 9: 33^9. 

Trudgill, P. (2003) Glossary of Sociolinguistics, 
Edinburgh University Press, Edinburgh. 

Williams, G. (1992) Sociolinguistics: 

A Sociological Critique, Routledge, 

London. 

Wolfram, W. (2004) The sociolinguistic 
construction of remnant dialects. In C. 
Fought (ed.), pp. 84-106. 

Wolfram, W. and R.W. Fasold (1997) Field 
methods in the study of social dialects. 

In N. Coupland and A. Jaworski (eds.), 
pp. 89-115. 



